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1. Introduction 

Let {Xt} be a covariance stationary process with spectral density 

f(u;)^\io\- 2d Mu;) ,ioe [-tt,tt] and d € (—1/2, 1/2) , (1) 

where /*(w) is continuous at zero, < /^(O) < oo and / /*(cj)du; < oo. To main- 
tain generality of the short-run dynamics, we do not impose a specific functional 
form on f±(uj). Equation ([I]) is referred to as a semiparametric model for f(co), 
specifying its form only near zero frequency. Time series with spectral density 
satisfy i ng (flT) can be observed in ma ny areas of applications; see for example 
Beranl (Il994 ). boukhan etaD (|2003h and the references therein. The process 



is said to have short-memory when d = 0, long-memory when d 6 (0,1/2) 
and negative (or intermediate) memory if d G (—1/2,0). Equation Q] is satisfied 
leading models for long and negative memory such as Fractional Autoregressive 
Integrated Moving Averages (FARIMA) and fractional noise. These, however, 
are parametric models, specifying f(uj) up to finitely many unknown parame- 
ters over all frequencies (— ir, The memory parameter d (like the scale pa- 
rameter /*(0)) is typically unknown and should be estimated. Many works have 
been devoted to the estimation of the memory parameter in the semiparametric 
context. O ne of the most popular is the or dinary least squares (OLS) estima- 
tor, due to Geweke Sz Porter-Hudak ( 19831 ) (GPH). This method employs the 



periodogram to obtain, through a regression equation, the OLS estimator of 
the memory parameter. Since the GPH estimator has been introduced, other 
variants of this method have been suggested with the aim of improving the 
quality of estimates and to achieve better asy mptotic sample properties; see 
Doukhan etHl (j2003h for an in depth survey and lNielsen Sz Frederiksen (j2005l ) 



for a detailed experimental study. 

In some areas of application, extremely long time series (hundreds of thou- 
sands to millions of samples are not uncommon, for example, in the analysis of 
teletrafic time-series or in high-frequency finance) has to be dealt with. Often, 
the analysis of these data has to be done on-line. Instead of computing the 
periodogram on the whole data set, a simple solution to the problem consists in 
partitioning the sample into subsets, referred to as epochs, computing the pe- 
riodogram over each epoch and averaging these to obtain an averaged version 
of the peri odogram. This l i ne of research has been pursued in the parametric 
context by Beran &: Terrin ( 19941 ) but has not, up to our best knowledge, been 



explored in the semiparametric context. 

In this contribution, we study the averaged periodogram spectral estimator, 
based on the division of series into epochs, to obtain the memory parameter 
estimate of a long- memory process. The estimation method follows the GPH 
procedure, where the periodogram is replaced by the averaged periodogram in 
the regression equation. Some desirable asymptotic properties of the proposed 
estimator are derived and empirical investigation gives evidence to support the 
use of the procedure as an alternative method to GPH to reduce the variance 
of the fractional memory parameter. 

All these topics are presented in the paper as follows: Section [2] outlines 
some properties of the averaged periodogram. Section [3] presents the proposed 
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estimator of the memory parameter d and discuss the statistical performance 
of the estimator. Section [J] presents the set-up of the Monte-Carlo experiments 
and assesses the finite sample properties of our estimator. 

Notations: In the paper, a = b denotes that a is defined as b, [a\ is the 
integer part of a and a A b = min(a, b). 



2. Some properties of the averaged periodogram of a fractional 
difference process 

Let {Xt}^ = i be a realization of the process Xt and N = gn, where g and n 
are integer values that correspond , respectively, to the number of epochs and 
the sample size of each epoch. The most natural tool for inference in the spec- 
tral domain is the periodogram, defined as the square modulus of the discrete 
Fourier transform (DFT). Instead of computing the periodogram over the whole 
dataset, the DFT and the periodogram of each individual epoch are obtained 
at the Fourier frequencies u k = 2nk/n, 1 < k < [n/2], i.e. for I = 0, . . . ,g — 1, 

n 

d i>n {uj k ) 4 (27m)- 1 / 2 £ X t+£n ^ , (2) 



t=i 



It,n{u k ) = \di>^{uJk)\ 2 = {2im)~ 



t=l 



(3) 



The averaged periodogram is then defined as the sample mean of the peri- 
odogram obtained on the successive epochs, 

Ig,n(Uk) = 9~ l I t,n{° J k) ■ (4) 

The averaged periodogram serves as the basis of the Welch estimator of the 
spectral density. Since in the sequel the spectral density / can be either zero 
(d < 0) or infinite (d > 0) at zero frequency, we have found more appropriate 
to state results using the normalized periodogram, i.e. the raw periodogram 
normalized by the inverse of the spectral density. For short-memory processes, 
i.e. d = in ([T|), it is well known that 

(i) the periodogram in each epoch is an asymptotically unbiased estimate of 
the spectral density, i.e. / -1 (u;fc)E[/o,n(a-'fc)] = 1 + 0(n _1 ), 1 < k < [n/2], 
where the 0(n _1 ) term is uniform with respect to ( w.r.t. ) the frequency 
coordinates k, 

(ii) the DFT ordinates in a single epoch at distinct Fourier frequencies are 
asymptotically uncorrelated, 

COV (/" 1 / 2 ( Wi )d0,nK), r 1/2 HH,nH)) = + 0{n~ l ) , 

l<j<k< [n/2], 
where the 0(n _1 ) term is uniform w.r.t. the frequency indices j, k. 
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(iii) the DFT ordinates over different epochs are asymptotically uncorrelated, 
i.e. for any I > 0, 

Cov (/ _1/2 (w;H,n(w;), r 1/2 KHnK)) =0(n~ 1 ) , 

l<j<k< [n/2], 

where the 0(n _1 ) term is uniform w.r.t. the frequency indices j, k and the 
epoch index I. 

Assuming that the number of epochs g is a fixed integer, the above results imply 
that 

(iv) the averaged periodogram is an asymptotically unbiased estimator of the 
spectral density, i.e. /~ 1 (cjfc)E[7 9jn (u;/ c )] = 1 + 0(n~ 1 ), where the (^(n" 1 ) 
term is uniform in k; 

(v) the averaged periodogram ordinates are asymptotically uncorrelated and 
its variance is equal to the square of the spectral density divided by the 
number of epochs, 

COV (f~ X (Uj)I g , n {Uj), f~ l {LOk)Ig,n{uk)) = 9~ l Sj,k + O^" 1 ). 

We now discuss som e periodogram properties for the process (pQ) wi th d ^ Gi- 
lt w as first shown by Kiinsch ( 19861 ) and exhaustively investigated by Hurvich Sz Beltrao 
in the single epoch case that the asymptotic behavior of low frequency 
DFT ordinates departs strongly from the weak dependence situation: in par- 
ticular, the DFT ordinates computed at Fourier frequencies ujj = 2tt j/n and 
u)t = 2nk/n for fixed positive integers j and k are correlated as n — > oo. Not 
surprisingly, the same departure from the weak dependence behavior is also ob- 
served in a multiple epochs scenario. The dependence in particular implies that 
the correlation among DFT coefficients over different epochs does not asymptot- 
ically vanish. More precisely, the following theorem shows that the correlation 
of the DFT coefficients evaluated at Fourier frequencies ujj and ui^, for fixed j 
and k, computed over different epochs does not vanish as n — > oo. 

Theorem 1. Assume that the spectral density f is given by ([I]). Let I > and 

1 < J ' < k be fixed positive integers. Then, 



(I 



where 



hm ufu$E [do,n("iHnH)] = Dx(^3,k,£) (5) 

hm ^4ndo,n{^)di,n^k)] = ( ^^^ D 2 (d;j,k,£) . (6) 

n^oo ■> ZTT 



/oo 
\uj\~ 2d A{oj - 2nj)A(2nk - uj)e~ Uul duj , (7) 
-oo 

/oo 
\uj\~ 2d A(io - 2irj)A(-2Trk - ujje'^du (8) 
-oo 



with A(w) = (e iw - l)/(iw). 
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Proof. See Section [5l □ 



Th eorem [J combined with the results of Theorem 5 given in lHurvich fc Beltrao 



<|l993h implies that the averaged periodogram is an asymptotically biased esti 



mate of the spectral density, i.e. limn.+oo f~ l {ojj)I g ^ n {ujj) / 1 for any given j, 
1 < j < A;, and the correlation of the averaged periodogram ordinates does not 
asymptotically vanish, i.e. 

for any given j and k. Nevertheless, under appropriate regularity condition for 
the spectral density of the short-memory process it can established that 
there exist two sequences {r(f;k)} and {r(f;j,k)} satisfying, for all n and all 
l<j<k< [n/2], 

\E[I g>n {u k )/f(u k )]-l\ <r(f;k) , 
\^oy{f~ l {ujj)Ig y n{^j)J~ l {^k)Ig,n{^k)\ <r(f;g,k,j) , 

such that 

• linifc^oo r(/; A;) = 0, which means that the bias is small for frequencies 
sufficiently far away from zero, and 

• for any sequence m n such that lim n ^oo m n = oo and lim ra _» 00 m n /n < 1/2, 
Y,i<j<k<m n r (,f'>9,j,k) = O (log r (m n )), for some r > 0. This shows that 
whereas the dependence does not vanish it is small when the Fourier 
frequencies are sufficiently far apart. It is also possible (see the results 
below) to get bounds of this quantity as a function of g. 

To derive the above results, some regularity assumptions need to be imposed on 
the spectral density of t he short-memory process. Consider the s et of functions , 
whic h is adapted from (Robinsonl . 1995 . Theorem 2) ( see also ( Soulier . 200 ll ) 



and (jMoulines fc Soulier! . l200^ vT 



Definition 1. For fj, > 1 and v £ (0,1]. Let £^(^,i/) be the set of functions 
4> : [— vr, 7r] — ► M + satisfying for all uj,uj' G [— vr, ir] \ {0} 

max 0(w) < ^(0) , (9) 

7T,7rJ 

\<(>(u;)-4>{u/)\<t^(0) } U ~fl , (10) 

\<j>(u) - 0(w')| < A*</>(0) ||c;| - \lo'\\ u , (11) 

The set of functions £*(/U, u) contains all the functions which are strictly posi- 
tive and continuously differentiable on [— vr, n]. In particular, if /* is the spectral 
density of an stationary and invertible ARMA process, then /* £ 1). More 

interestingly, it also contains functions of the form /*(w) = f i ,(uj) J i-a 2 \uj\' y , where 
/* is a strictly positive continuously differentiable function. Note that such spec- 
tral density appears in the so-called signal-plus-noise model, where a fractional 
process with smooth spectral density /* is observed in presence of white noise, 
uncorrelated from the process. 



imsart-generic ver. 2007/09/18 file: REisenFrancoMolinesVl.tex date: February 2, 2008 



/Log-average periodogram estimator 



(> 



Theorem 2. Let 5-, 5+ £ [0, 1/2), fi > 1 ,anc? f £ (0, 1] &e constants. Assume 
that f is given by (pTJ) with f+ E and d £ [— <5_,<5 + ]. Then, there exists 

a constant C (depending on the constants 5-, 5+ and fi) such that, for any 
1 < j < k < ra/4 and l> 0, 



E[/- 1 (a; fc )Io,„(a; fe )] - 1 < Clog(l + k)/k , 



(12) 



/ 1/2 (LOj)f 1/2 (uj k ) \E[d ,n{vj)de,n(uk)] + ®[do,n(uj)de, n (uk)]\ 

< Clog(k)j~^k^~ 1 fa-« 1_2t 9 A1 > + {in)- v \ . (13) 

Proof. See section [U □ 

It is worthwhile to note that the dependence among successive epochs does 
not asymptotically vanish as n — > oo, in strong contrast with the short-memory 
case. Also, the strength of the dependence among the epochs depends on the 
memory coefficient d. 

Using Corollary 2.1 in ISoulierl l|200ll ) and under the additional assumption 



that the process is Gaussian, it is possible to translate the results above to non- 
linear transforms of the DFT ordinates, for instance to the "log" function of the 
average periodogram. These are presented in the following corollary which lead 
to results that provide a theoretical justification for the estimator proposed in 
this paper. Some additional notations are required to state the results. Let U be 
a central chi-square with 2g degrees of freedom. Then, E [log ( £7/2)] = tjj(g) and 



VarJlog(C//2)] = ip'(g) where is the digamma function (see ([Johnson fc Kota . 
1970, P- 198)). For instance, ^(1) = —7, where 7 is the Euler constant and 
ip'(l) = 7r 2 /6. It is well-known that lim^oo gijj'(g) = 1. Hence, for large g, 
Var[log(f//2)] = 0{g~ l ). Let 

in,k = log [Ig,nM] ~ log [/*(£<;*)] " VG?) + log( 5 ) • (14) 

The following corollary establishes the statistical properties of £ n k- 

Corollary 3. Assume that Xt is a Gaussian process. Then, there exists an 
integer K and a constant C (depending only on the constants 5-, 5+, \x, and 
K) such that, for any K < j < k < [(n — l)/2] 

|E[e„j]i <cio g (i+ j)/j , 

|Var[£nj]-^(0)| <Clog 2 (l+j)/i 2 , 

|cov(e„j,en,fc)i < c\o g \k)r 2 \ d \k 2 \ d \- 2 . 

Proof. See Section [H □ 



3. Estimation of the memory parameter based on log-periodogram 
regression 

As an application of the results obtained above, it is argued in this section 
that the averaged periodogram is a simple mean to reduce the variance of 
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semiparametric estimator of the memory parameter based on log-periodogram 
regression. For simplic ity, in this contribution the foc us will be on the GPH 
estimator proposed bylGeweke &: Porter-Hudak dl983h (GPH) and further an- 
alyzed bv iRobinsonl (119951 ) and iHurvich et al.1 (|l998h . The same reduction of 
variance holds for the bia s redu ced log-periodogram estimator introduced by 
Andrews &i Guggenberge 2 l|2003h . which is based on regression of log .f+fu;) by 



an eve n polynomial of degree 2r, and for the estimator introduced bv lGuggenberger fc Sun 



which is obtained by taking a weighted average of GPH estimators over 
different bandwidths. 

The GPH estimator is the ordinary least square (OLS) regression estima- 
tor obtained from an approximated regression equation of the logarithm of the 
spectral density (JT]), having the logarithm of the spectral density as the depen- 
dent variable and log(w) as the independent variable. Taking the logarithm of 
([T]), the log-spectral density can be expressed as 



log f(u) = log /*(0) - 2dlog(u;) + log 



(15) 



For the proposed estimator, the spectral density f{ui) is replaced by the aver- 
aged periodogram I g>n (uj), and using the decomposition (fT4"j) . an estimate of d 
is obtained from the regression equation 



log [I g ,n(wk)] =ao~ 2dlog{uj k ) + log [/*(wfc)//*(0)] + £ 



n,k 



(16) 



where the intercept is ao = log/*(0) + ip{g) — log(g) and the random variables 
{£n,fc} are defined in Corollary [3j The GPH estimate of the memory parameter 
d is thus given by 



d„ 



a k,n( 

k=l 



m. 



log [lg,n{Uk)] 



(17) 



where {m n }, the bandwidth in the regression equation (|16f) . is a sequence of 
integers and the weights ak{m n ) are given by 



ak,n( m n) 



[-21og(o, fc )] - m- 1 E J m =i[-21ogK)] 
ET=i {[-21og(u; fc )] - mn 1 Ep^log^)}}' 



(18) 



We will now derive a central limit theorem for the above estimator. To do this, it 
is required to state some addition al regularity conditio ns of the spectral density 
of the short-memory process; see iGiraitis et all (120001 ). 

Theorem 4. Assume that {X{\ is a Gaussian process with spectral density f 
satisfying ([1]) with f+ G £^(/.i,/3) for some fi < oo and (3 G (0, 1] and 

for all uj G [-O , Qq], - /*(0)| < ^(O)^^ . 

Let {m n } be a non- decreasing sequence of integers such that 

lim (m" 1 + mf +l n- 2f3 ) = . 



(19) 



Then ^m n (d mn g — d) is asymptotically distributed as Gaussian with zero-mean 
and variance ip'(g)/4:. 
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Using similar arguments as those given in Hurvich et al. ( 19981 ). the bias and 
variance of d mn „ are computed by assuming that /* is three times differentiable 
in a neighborhood of the zero frequency. Using once again Corollary [3l 



E 



m n ,g 



d= Yl a j,n( m n) l°g f*(Uj) + a jA m n)^[£n,j] (20) 
3=1 3=1 



where £ n is defined in Q14]) and 



j=l k=lj=k+l 

(21) 



Along the same lines as iHurvich et al.l ()1998l ) (Lemma 1 to Lemma 8), we 
establish an explicit expression for the mean-square error (MSE) of the proposed 
estimator. 

Theorem 5. Assume that f+ G and satisfies the conditions /*(0) = 0, 

\f"(uj)\ < oo and \ f"'(ui)\ < oo for any to G [— Uq,Uq] where G (0, it]. Then, 



E[d. 



m n ,g 



d] 



2vH^(0)m, 



9WW 



log 3 (m n ) 



and 



Var ( d m„, 9 ) 



V>'(g) 
4m„ 



1 

m r 



(22) 



(23) 



Neglecting the remainder terms in the bias and variance, and assuming that 
f"(0) 7^ minimizing the approximate expression for the MSE, i.e. 



MSE(n,g) 



9/*(0)n 2 



+ 



4m„ 



(24) 



with respect to the bandwidth parameter m n for a given number of epochs g 
yields the asymptotically optimal choice for the bandwidth m n (g) 



m n {g) 



1/5 



n 



4/5 



(25) 



where = (4/81)vr 4 {/* (0)//*(0)} 2 . With this choice for m n (g), the optimal 
value for the mean-square error is 



MSE(n, 5 ) = C4^( 5 )} 4 / 5 n- 4 / 5 , 



(26) 



where C* = \{IQ)~^B^ 2/5 + (16B*) 1 / 5 / 4 }- We will now discuss the poten- 
tial advantages in performance obtained by dividing the series into epochs. 
The optimal MSE of the classical GPH (using a single epoch) is given by 

MSE(iv, i) = aw(i)} 4/5 iv- 4/5 . 
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Dividing the series into g epochs each of size n = N/g, the optimal MSE is 
given by 

MSE(N/g,g) = C^' {g)} A '\N / g)^ = C,{g^(g)}^N^ . (27) 

Since g t— > gip'(g) is a decreasing function, the optimal MSE is also a de- 
creasing function of the number of epochs, which means that dividing the 
series into epochs is a very simple way to improve the MSE. The quantity 
gip'(g) decreases from 7r 2 /6 to 1 and as g goes to infinity (more precisely, 
gif/(g) = l + l/(2g) + 0(g- 2 )). For g = 3, gif)'(g) is 1.1848 and its value changes 
slowly thereafter, as it is shown in the following tabulation. 



m 


1 


2 


3 


4 


5 


6 


7 


8 


mip'(m) 


1.646 


1.290 


1.185 


1.138 


1.108 


1.090 


1.080 


1.070 



4. Monte-Carlo results 

This section provides a limited Monte Carlo experiment to support our claims. 
For this purpose, realizations of a Gaussian white noise sequence et, t = 1, ■ ■ ■ , n, 
with unit variance, were generated by IMSL-FORTRAN subroutine DRNNOR 
and trajectories of Gaussian processes {X{\ with spectr al density satisf ying 



dU) were simulated according to the procedure outlined by iHoskingi (|198ll ). To 
assess the performance of d mn:g , we compute the bias, the mean-square error 
(mse) and the coverage rates (cr) of the asymptotic confidence interval based 
on the normal distribution (see Theorem ([!])). The quantities were calculated 
based on 2,000 replications for different sample sizes N and number of epochs 
g. The results are displayed in Tables [I] to [3j In each experiment, the sampling 
distribution for the standardized d mnt9 estimator was calculated to obtain the 
coverage rate, which refers to the percentage of cases where the true value of d 
(d = 0.3) lies inside the 95% asymptotic confidence interval (d mnj g ± 1.96o- e>n ), 
where a 2 n is the asymptotic variance. We use two different approximations of 
the asymptotic variances, which are asymptotically equal but different for finite 
sample size. The variances are o\ n = ip' '(g) / '4m n (which is the limiting variance 
in Theorem 0|) and 



2 



E]T=i {[-21og(« fc )] - mn 1 E£i[-21og( Wi )]}' 



which is the variance of the regression obtained as in the case where the av- 
eraged periodogram ordinates are independent with equal variance ip'(g). For 
the ARFIMA process with short-memory parameters, the bandwidth m n is the 
one given in the previous section (m n (g)) that minimizes the asymptotic mean- 
square error of the estimator; we assume here that the parameters specifying 
the short-memory component /* of the ARFIMA model are known, to avoid 
discussing the separate issue of the optimal choice of the bandwidth (this may 
be seen as an oracle estimator in this semiparametric context). In the case where 
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/*(w) = 1, short-memory dynamics are not present, hence two large and fixed 
bandwidths were used, mi = n - 7 and 777-2 = These bandwidth choices 

were considered with the aim to verify the finite property of the bandwidth on 
the estimates and the convergence of the standardized estimator to the normal 
distribution. 

Results from Table [T] support the asymptotic properties discussed in the 
previous sections for the ARFIMA(0, d, 0) model. As it can be observed, the 
mean-square error decreases as the number of epochs g increases. Breaking the 
series in a fixed number of epochs can produce a significant reduction in the 
mean-square error (thi s effec t is similar to the pooling in frequency domain 
advocated in lRobinsonl(jl995l ^. Although large g does not bring too much gain 
in terms of the mean-square error, it does not cause penalty in the estimates 
unless it reaches very large value, as it has been already discussed in the previous 
section. 

Even if the sample sizes used in this limited Monte-Carlo experiments are 
not large, the coverage rates of the asymptotic confidence intervals are reason- 
ably accurate. The bandwidth m\ produces estimates which are (as expected) 
less accurate than 7712. For both bandwidths, the coverage of the asymptotic 
confidence intervals based on o> jn are precise, even for relatively small sample 
sizes. The coverage rates of the standardized estimators using the asymptotic 
standard deviation (cr a ) is reduced as g increases. This indicates that the con- 
vergence of d mn: g , standardized by a a , to the N(0,1) is affected by the sample 
size reduction. 

Table 1 
ARFIMA(0,d,0) 

d = 0.3 



N 


9 


mi 


mean 


mse 


C7V 


cr a 


7712 


mean 


mse 


C7Y 


cr a 


512 


1 


78 


0.3032 


0.00681 


94.7 


92.1 


255 


0.3035 


0.00222 


96.0 


97.9 




2 


48 


0.3068 


0.00443 


95.7 


91.5 


127 


0.3051 


0.00181 


95.2 


95.6 




4 


29 


0.3073 


0.00371 


95.2 


87.4 


63 


0.3034 


0.00193 


94.0 


91.1 


2048 


1 


207 


0.3004 


0.00215 


95.2 


94.6 


1023 


0.3006 


0.00052 


94.6 


98.6 




2 


128 


0.3027 


0.00148 


95.8 


92.7 


511 


0.3007 


0.00041 


94.8 


98.1 




4 


78 


0.3032 


0.00109 


96.2 


93.1 


255 


0.3027 


0.00036 


96.0 


97.4 




8 


48 


0.3034 


0.00096 


94.5 


89.6 


127 


0.3022 


0.00039 


94.8 


95.9 




16 


29 


0.3043 


0.00084 


95.7 


88.8 


63 


0.3047 


0.00046 


94.1 


90.7 


8192 


1 


548 


0.3007 


0.00078 


94.5 


94.3 


4095 


0.3007 


0.00013 


95.2 


99.6 




2 


337 


0.3021 


0.00054 


95.6 


94.4 


2047 


0.3006 


0.00010 


94.9 


99.4 




4 


207 


0.3016 


0.00039 


94.3 


93.0 


1023 


0.3008 


0.00009 


94.9 


99.0 




8 


128 


0.3036 


0.00032 


94.1 


92.3 


511 


0.3016 


0.00009 


94.3 


98.3 




16 


78 


0.3034 


0.00027 


94.2 


90.3 


255 


0.3021 


0.00009 


94.2 


97.3 



Table [2] gives the results for the ARFIMA process when non-trivial short- 
memory components are present in the model. As it has often been reported 
in the long-memory literature, the short-memory component causes significant 
bias in the estimator of the long-me mory parameter, esp e cially i f the bandwidth 
i s not prope rly set; see, fo r exam ple, Hurvich fe Beltraa ( 19941 ). Hurvich Sz Ray 
(|l995h and iReisen et al.l d200lh among others. As shown by the asymptotic 
analysis, the bias of the estimator is not significantly affected by the division of 
the sample into epochs, i.e. at least when the number of epochs g is not large 
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compared to N. This experiment shows that the improvement of the estimates 
in terms of mean-square error depends on the sample size N and the number 
g of epochs. For all decrease of the mean-square error is observed 

when taking g = 2,3, but the improvement becomes marginal when g > 4 . 
This empirical property of the estimator is not surprising and it was justified 
theoretically in the previous section. For example, for g = 2 the decrease of 
the mse predicted from the asymptotic expression (|27p when using the optimal 
bandwidth is 0.783, which is consistent with the values found in the Monte-Carlo 
experiments. The coverage rate is stable when using the regression variance in 
the standardized estimator, by the other hand the values of cr a are reduced as 
g increases, which is presumable due to the sample size reduction. 



Table 2 

Estimation in ARFIMA(l,d,0) model using the optimal bandwidth 







d 


= 0.3, <j> = 


-0.3 






d 


= 0.3, (j> 


= 0.3 






N 


9 


m n 


mean 


mse 


cr r 


cr a 


m n 


mean 


mse 




cr a 


512 


1 


103 


0.26682 


0.00574 


92.8 


90.2 


62 


0.3440 


0.01083 


91.7 


87.3 




2 


49 


0.26973 


0.00511 


92.7 


88.6 


29 


0.3445 


0.01049 


91.8 


83.3 




4 


23 


0.27282 


0.00572 


92.7 


84.9 


14 


0.3517 


0.01252 


91.6 


76.4 


2048 


1 


312 


0.2824 


0.00175 


93.2 


91.6 


190 


0.3247 


0.00299 


92.5 


91.1 




2 


148 


0.2856 


0.00151 


92.6 


90.5 


90 


0.3256 


0.00275 


91.6 


88.6 




1 


72 


0.2847 


0.00148 


93.1 


89.1 


44 


0.3272 


0.00290 


91.1 


85.0 




8 


35 


0.2852 


0.00153 


93.2 


87.1 


21 


0.3283 


0.00346 


89.9 


80.2 




16 


29 


0.2864 


0.00186 


93.9 


83.9 


10 


0.3349 


0.00458 


91.2 


73.2 


8192 


1 


947 


0.2905 


0.00053 


93.1 


92.7 


577 


0.3135 


0.00091 


93.1 


91.6 




2 


451 


0.2919 


0.00044 


93.4 


92.4 


275 


0.3146 


0.00084 


90.7 


89.3 




4 


219 


0.2925 


0.00041 


92.9 


91.5 


134 


0.3158 


0.00082 


90.7 


88.3 




8 


108 


0.2932 


0.00041 


93.6 


91.2 


66 


0.3155 


0.00089 


90.7 


86.0 




16 


53 


0.2939 


0.00043 


93.0 


88.9 


32 


0.3184 


0.00106 


89.9 


82.2 



The optimal bandwidth depends on the parameters of the model, thus it is 
not possible to obtain this quantity in practical situations. Indeed, it is well 
known that the semiparametric estimators are bandwidth driven estimation 
procedures. Due to these peculiarities, an empirical investigation was considered 
for the bandwidth m n = (y ) ' 5 , and the results are in[3l It is not surprising that 
the use of this bandwidth produces estimates with larger mean-square errors. 
However, there is an empirical evidence that the reduction of the mean-square 
error can be obtained even for g > 4. In addition, the cry is more accurate than 
the previous case, which can be explained by the fact that the reduction in 
the number of the periodogram ordinates in the regression mitigates the effect 
of the AR coefficient. The reduction of the impact of the AR coefficient for 
this choice of the bandwidth is also manifested by the similarity between the 
estimates of the two models. 
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Table 3 

Estimation in ARFIMA(l,d,0) model using m n = ( — ) 







d 


= 0.3, (f> 


= -0.3 






d 


= 0.3, <f> 


= 0.3 






N 


g 




mean 


mse 


cr r 


cr a 


m n 


mean 


mse 


C7Y 


cr a 


512 


1 


22 


0.3016 


0.02873 


95.0 


88.7 


22 


0.3116 


0.02782 


96.0 


90.0 




2 


16 


0.3039 


0.01656 


95.8 


87.1 


16 


0.3205 


0.01781 


94.6 


86.2 




4 


11 


0.2916 


0.01388 


94.5 


81.8 


11 


0.3336 


0.01441 


93.6 


80.3 


2048 


1 


45 


0.3008 


0.01197 


95.8 


91.4 


45 


0.3053 


0.01185 


94.8 


92.2 




2 


32 


0.3075 


0.00714 


95.0 


90.6 


32 


0.3064 


0.00748 


94.4 


89.2 




4 


22 


0.3064 


0.00470 


95.3 


88.4 


22 


0.3117 


0.00495 


95.6 


86.9 




8 


16 


0.2999 


0.00337 


96.0 


87.2 


16 


0.3216 


0.00414 


93.2 


82.8 




16 


11 


0.2926 


0.00305 


94.7 


83.9 


11 


0.3390 


0.00455 


88.9 


72.6 


8192 


1 


90 


0.3066 


0.00573 


94.7 


91.9 


90 


0.3076 


0.00536 


95.0 


93.5 




2 


64 


0.3057 


0.00316 


94.8 


91.7 


64 


0.3014 


0.00312 


95.4 


92.5 




4 


45 


0.3051 


0.00217 


94.4 


90.6 


45 


0.3057 


0.00212 


95.4 


91.2 




8 


32 


0.3046 


0.00158 


94.3 


90.1 


32 


0.3061 


0.00158 


94.0 


89.5 




16 


22 


0.3045 


0.00122 


94.3 


85.0 


22 


0.3141 


0.00140 


92.4 


83.5 
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5. Proof of Theorem [T] 

Denote by 

D n {u) = e iw (l - e in ")/(l - O (28) 
the Dirichlet kernel. Straightforward calculations show that 

EK,o(wi)4,«k)] = r f(Lv)E ntjtk (u)e- iinuj duj , (29) 

J — n 

where 

E n,j,k(w) — (2irny 1 D n (uJ - ojj)D n (oJ - LO k ) ■ (30) 

The proof then follows from the change of variable lo — > ulo, using the dominated 
convergence theorem. 



6. Proof of Theorem [2] 

We preface the proof by two technical lemmas, which are used in the sequel. 
Throughout this section, C is a constant, depending only on fj,,u, 5_ and 5 + , 
but which may take different values upon each appearance. 

Lemma 6. There exists a constant C (depending only on 5-, 6+, fi) such that, 
for any d £ [—5-, <5+], /* G v), and uj,uj' G [—it, n] \ {0}, 



- 2d U(cu) - \uT 2d h{J) < C/*(0)(M A l^'l)- 1 - 2 ^ - J\ . (31) 



imsart-generic ver. 2007/09/18 file: REisenFrancoMolinesVl.tex date: February 2, 2008 



/Log-average periodogram estimator 



13 



In addition, there exists a constant C , depending only on [i and v such that, for 
all integers I > 1 and j 6 {1, . . . , n/2}, 



\f(u)-f(u + ir/en)\du 

CU0)[{£n)- 1 {(£n) 2d + (j/n)- 2d } + (£n)- v (j/n) 1 - 2d ] . (32) 



< 



Proof. The proof of (|3ip is obvious and is omitted for brevity. Note first that, 
for any uj,u' G [— it, it], 



, |-2d_, /,-2d 



2,/ 



|/H-/(wOI<c/*(o){ 

Applying this inequality with u/ = u> + 7r/£n, yields to 
\f(oj)-f{oj + ir/£n)\doj 



+ M M — w 



'i r} • (33) 



< C/*(0) 

= ca(o) 



OJ 



OJ 



-2d 



-2d 



u + 7r/£n\- M duo + (ln)- v \ \uo\- M duj 



\uj + ir/ln\- 2d du + (1 - 2d) -1 (&i) -, >j 



i/i, , 1 1— 2d 



On the intervals [— ujj, —2ir/£n] and [ir/£n,ujj], we use the bound 

< C\d\ {|wr 1_M + |w + vr/^r 1 "^} (M" 1 



which yields 

-2-n/in /-u; 



+ 



j 

it j in 



\u\~ 2d - \u + 7r/£n\~ 2d \ dio<C [{ln) 2d + n 2d r 2d } {In)" 1 . 



On the interval [—2ir/£n,iT/£n], we use the bound 



-2d 



\uj + 7r/ln)\- 2d \ <C{\u>\ 



-2d 



+ \i0 + 7T 



//n|" M } , 



which yields 



u\~ M + Tr/£n)\- M doj < C(l - 2d)" i (£n) 

-7r/2fa 

which concludes the proof. 
Define 

A £>n (w) 4 (27m)- 1 { D n (u; + vr/£n) - D n (u;)} , 
where Z) n is the Dirichlet kernel defined in (|28p . 



□ 

(34) 



Lemma 7. There exists a constant C such that, for all n,£ > 1 and u; suc/i 
i/iai < a; < u; + 7r/ (in) < tt, 



\Ai, n (cj)\ <cr\\ + n\u\y l 



(35) 
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Proof. For any uj,uj' £ (0, it], 



\D n {u) - D n {u')\ 



k=l Ju) k=l 



< 



d\<C 



n 



1 + nX 



dX 



k e ikX 

k=l 

= Cn |log(l + noj 1 ) — log(l + nui)\ . 
Thus, if < u < to + it /{In) < it, 

|A^ n (o;)| < C |log(l + nui + tt/£) - log(l + nu)\ < C'(l + nw)" 1 !" 1 



□ 



Proof of Theorem^ The proof for t = fo llows from Moulines Sz Soulier ( 19991 ) 
(which is a refinement of ( Robinsonl . 1995 . Theorem 2)). We consider the case 
I 7^ 0. Since the function u ^ f{u)E n - k {u)<r Un,jJ is 2vr-periodic and Q-^+^/tn) 



-ilnui 



we may rewrite (j29|) as 



2E[d ni o(wj) d n/( w fc)] 

= r {f{ui)E^ fc (a;) - /(w + 7T/£n)E ndjk {oj + vr/^n)} e" 1 

J— 7T 

= A(n, j,fc) + B{n,j,k) , 



da; 



where E n> j >k {uj) is defined in (i30l) and the two terms A{n,j,k) and B{n,j,k) 
are respectively defined by 



A{n,j,k) 
B{n,j,k) 



{f{u) - f{u + it I In)] E n ^ k (u;)e- i£n "dLO , (36) 
f{u + ir/tn) {E nJtk (w) - E n>jjk {u + ir/ln)} e'^du . (37) 



First consider A n> j k . Denote g+ = hjw) — fAio + ir(£n)). 

We proceed like in the proof of (jRobinsonl . Il995l . Theorem 2). We decompose 
A as the following sum 



A{n,j,k)±ir f 

6 



{/M - f{co + 7T/£n)} E n j !k {u)e 



-\lnu) 



doj 



i=l 

A 

A 



where W±{n,j,k) = {—Uj/2 < uj < Uj/2}, W2{n,j,k) = {ujj/2 < uj < {cjj + 
u k )/2}, W 3 {n,j,k) 4 + u k )/2 < u < 3uj k /2}, W 4 {n,j,k) 4 { 3wfc /2 < uj < 
it}, W 5 {n,j,k) = {-it < uj < -uj k }, and W 6 {n,j,k) = {-uj k < uj < -ujj/2}. 
Note that, for ujq < it, there exists a constant C < oo (depending only on luq) 
such that, for all uj 6 [— ujq,ujq], 



\D n {u)\ < Cn{\ + n\u\)- 1 , 

\E n j ik (uj)\ < Cn{l + n\uj - ujj\)~ l (l + n\u> - u> k \)~ . 



(38) 
(39) 
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For lo G Wi(n, j, k), ([39]) implies that \E n j^{oj)\ < Cnk" 1 ]^ 1 . Using the bound 
), we therefore obtain 



^4\ qi (nJ,k)\l < j^k^n 1 ^ \f{u)-f{u + v/ln)\&u 

J-Uj/2 

< C/*(0) f£-0--^jdr-l k d-l +£ -lj-d-l k d-l + ^ n y v .-d k d-l\ 

For d G [0,1/2), we have f' 1 < j~ d ; for d G [0,1/2), j~ d k d ~ l < 
Therefore, for d G (-1/2, 1/2), 

uo d oo d k \ qi {n,j,k)\ < C/*(0) {r« 1 - M ) Al > + {ln)- v ] j-^k^' 1 . (40) 

By ([33D, for w G [wy/2, (^ + w fc )/2], 

- f(u + 7r/€n)| < C/^Ojn 2 ^- 1 - 2 ^- 1 . 

For lo G W2(n,j, k), we use the bounds \D n {ui — Uk)\ < Cn{l + (fc — i)}" 1 , and 

|D n (w - Uj)\du < C7Iog(Jfe) 



(see (jRobinsonl . Il994l . Lemma 5)). Thus 

<4$ffi(n,j, < C/*(0) t- X r + (k - j)}' 1 log(fc). 

We consider separately the cases j < k < 2j (j and /c are close) and j > 2k (j 
and k are far apart). If j < k < 2j, then j -1 < 2/c" 1 , so that j~ 1 ~ c( A; d {l + (A; — 
j)}" 1 < 2j- d k d ~ l . On the other hand, if k > 2j, j < k/2, (k - < 2k' 1 . 
Therefore, j~ l ~ d k d {l + (k—j)}~ 1 < 2j~ d k d ~ 1 . Combining these two inequalities 
and using, for d < that j~ d k d ~ l < j d k~ d ~ l yields, for d G (-1/2, 1/2), 

LO d LO d k \q 2 (n,j,k)\ < CUW^j-WkW- 1 ^!*) . (41) 

The bound for can be obtained exactly along the same lines. For u G 
Wi(n,j, k), we use (j3T1) which shows that 

\f(u) - f(u + n/£n)\ < C h{Q)r l n 2d k~ l ~ 2d , 

and, by (f39|) . |£^ nj) fc(w)| < Cn _1 (w — w^) -2 , which imply 

l„-d7.-l-d„-l l.. . 

1 -du-2-d 



LoM\q A (n,j, k)\ < Cf^r'fk-^n- 1 / (w - a; fc )^dw 

J3w fc /2 



For d > 0, fk~ 2 ~ d < j' d k d ~ l and for d < 0, j d k- 2 - d < j d k~ d - 1 . Therefore, 
for any dG (-1/2,1/2), 

M\q A (n,j,k)\ < CA(0)r 1 r |d| fc Mhl log(fc) • (42) 
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The bound for q§ can be obtained exactly along the same lines. For to £ 
W 6 (n,j,k), we use the bounds \ f{u) - f{u + w/£n)\ < C f^)l~ l n 2d j~ 1 ~ 2d , 
\D n (u — u>k)\ < Cnk" 1 , and J^^ 2 \D n (u! — ojj)\du < Clog(k), which imply 

u d u; d \q 6 (nj,k)\ < CfM^ l r l ~ d k d ~ l \og{k) . 

For d > 0, j' 1 -* < j' d and for d G (-1/2,0), j-^-d^d-i < jd k -d-l^ which 
implies, for d £ (-1/2, 1/2) that 

ufu d \q 6 (n,j,k)\ < Cf^r'r^k^- 1 . (43) 

By combining the bounds obtained in (|40p . (|4ip . (|42p and (|43|) . we therefore 
obtain the following bound 

ufu d k \A(n,j, k)\ KCfM^-^ + iinyv + r'logik^r^kW- 1 . (44) 

We now consider the second term B(n,j, k) defined in (I37j) . Note that 

En,j,k( u ) ~ E n,j,k(w + n/£n) 

= A ejn (tv - Uj)D n (u - u k ) + D n (ui + 7r/£n - ojj)h &)n (u) - u k ) , (45) 

where D n is the Dirichlet kernel and A^ n is defined in (|34p . In addition, since 
D n (uj) and A£ jn (w) are polynomial in e luJ of degree at most n, for any ^ > 0, 
and any 1 < j, k < [n/2], 

A^w - ujj)D n {uj - uj k )e- ienuj duj = . (46) 

J — n 

Using this identity together with (145p . we may split B(n,j,k) into two terms 
Bi(n,j,k) and B2(n,j,k) which are defined as follows: 

B 2 = {/ (u + - / (w fc + ^ } D n (^ - u^A^u, - c^^e-^dw . 

These two terms can be handled similarly. We consider B\{n,j,k). We decom- 
pose this term as the sum Bi(n,j, k) = Ya=i Qi{ n -,ji k), with 

qi(n,j,k) = 

where the intervals Wi(n,j, k), i = 1, 6 are defined as above. For u; S Wi(n, j, &), 
we have 

+ n/in) - f( Uj + it/ln)\ < CA(0) (|u,|- 2rf + . 
Moreover, Lemma [7] shows that |A^ n (ci-> — uij 

)| < Cl~ x j~ x and dHSl) implies 

\D n {oj — tv k )\ < Cnk~ x . Therefore, 

\qt(n,j, k)\ < CMO)n(jke)- 1 / {\uj\- 2d +ujf d )duj < C f*(0)n 2d j~ 2d k" 1 r 1 . 

J-ujj/2 
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Since for d < 0, j~ d k d ~ 1 < j d k~ d ~ 1 , this bound implies, for any d G (—1/2, 1/2), 

ujui\9y{n,3M < Cf^r^-WkW- 1 . (47) 

Consider now to G W2(n,j,k). We consider separately the case j < k < 2j (j 
and k are close) and k > 2j (j and k are far apart). In the first case (j and k 
close), we use the bounds 

\f{u + ir /In) - f(uj + n/£n)\ < C U^uj 1 ' 2 ^ - Uj \ 

(see (j3Tj) ). | A^ n (u; — Wfc)| < C^ -1 ?!" 1 !^ — Wjl" 1 (Lemma [7]) , and |Z) n (u; — cjfc)| < 
Cn{l + (fe — , which imply that 

r{u k -\-uA/2 

uy k \q 2 (n,j,k)\ < CA(0) W ^{1 + (A; - j)}" 1 ^" 1 / a," 1 " 2 ^ 



<cf*(o)r i j- i - a k a . 

Since j" 1 < 2/c" 1 , j- l ~ d k d < 2j- d k d ~ 1 < 2j^k^~ 1 . Therefore, for j < k < 2j 
and slide (-1/2,1/2), 

u$4\q 2 (n,j, k)\ < CUOy^r^kW- 1 . (48) 

In the second case (j and k far apart), we use the bounds \f(co + ir/£n) — f(ujj + 
n/ln)\ < CU<d){ujf d + ujj 2d ), \D n {oj - oj k )\ < Cnk~\ and 



Mw k +uj J )/2 r(ui k +Wj)/2 

/ \^ n (u-oj-)\duj < Cr 1 / (l+nw) _1 <L; < Cr x n~ x log(fc) . 

Jwj/2 Jujj/2 

This implies that 

^|g a (n,j\*)| < CA^r 1 (j-^- 1 log(fc) 

<C/*(0)r 1 r |d| fc l<i| " 1 log(A:) , (49) 

For uj G W3(n, j, fc), and j < k < 2j, we use the bounds |/(cj + ir /In) — f(uj + 
ir/£n)\ < Cojj l ~ 2d \uj - |A^ n (w - < Ci~ l n~ l \uj - Uj\" x , and 

3w fc /2 

\D n (u-u k )\du < Clog(k) . (50) 

(Uj+U k )/2 

Therefore, since k < 2j, 

ufu d \q 3 (n,j,k)\ < CA(0)log(fc)wri-^ r i n -i < C/^r^-MfcMI- 1 . 

For G W 3 (n,j,k), and j < fc/2, we use the bounds \f(uj + n/ln) — f(uj + 
ir/£n)\ < CU(0) {u^ 2d + uj- 2d ), \A £tn (u - < C^n' 1 ^ 1 , and §0]). 
Therefore, for j < k/2, 

ufut\q 3 (n,j,k)\ < C/*(0) log^^Ar 1 ((fc/j) d + (j/A;) £ 

< c , /*(o)r 1 i-i <, ifci d i- 1 
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For ijj E W4(n,j, k), we use the bounds \f(u> + ir/£n) — f(ujj + 7r/^n)| < 
Cf+(0){uj- 2d + LuJ 2d ), \A ttn (u-Uj)\ < Ce^n" 1 ^ 1 , and \D n (u-u k )\ < Co;" 1 , 
which implies that 

J3u k /2 

<Ch{Q)r l k-\ul 2d + uJ 2d ) . 

Hence, 

u d u d k Mn,j,k)\ < CMO^k- 1 ((k/j) d + (j/k) d ) < CUWi-ij-WkW- 1 . 

(51) 

The bound for q§ can be obtained exactly along the same lines. For to 6 
W 6 (n,j, k), we use the bounds \f(u: + vr/£n) - /(wj + vr/^n)| < Cf±{0)(uj^ 2d + 
JWb(nj,fc) lAi.n^-WjOldw < CtT 1 n~ 1 log(&) and \D n (w-w k )\ < Cnk" 1 , 
which implies that 

u d uj d \q 6 (n,j,k)\ < CUMr 1 (r d k d - 1 +j d k- d - 1 ) log(fc) 

<C7/*(0)r 1 r |d| fc l<f| - 1 log(A;). 
This concludes the proof. □ 



7. Proof of Theorem [4] 



The proof is similar to the one used for the pooled period ogram giv e n in 
Moulines &: Soulier ( 20031 ) (which simplifies the arguments given iRobinsonl (jl995h 
being based on the cen t ral lim it theorem for non-linear functions of Gaussian 
vectors given in ISoulierl(|200lh . The error d 9:n — d is naturally decomposed into 
a stochastic and a bias terms as follows 



dg,n ~d=Y^ a k,n( m n) log [l g ,n(^k) / f (^k)] 



k=l 



+ a kA m n) log [f+(uk)/f* 



(m n ,g) + B n (m n ,f± 



k=l 



where the coefficients {a k ,n( m )} is defined in ([18]) , In the previous expression, 
Sni^n^g) is a stochastic fluctuation term (which depends in particular on the 
number of epochs) and B n (m n , /*) is the bias caused by the approximation in 
the neighborhood of the zero frequency of /* by a constant. The result will follow 
from the weak convergence of the stochastic term S n (rn n , g) and from a bound 
for the bias term B n (m n , /*). By ( Moulines fc Soulier! . 20031 . Lemma 6.1), there 
exists a constant C(Oo, /3,/i) such that, for any /* £ jF*(f2o, £t) and any non- 
negative integer m satisfying 2itm/n < SIq, |J3 n (m, /*)| < C(Qq, (3, /j,)(m/n)P . 



0. 



Therefore, under the condition ([19]) . linin^oo y^m^ 

To establish the weak converge nce result, we use (iMoulines fc Soulierl . I2003L 
Theorem 9.7) (which is based on ([Soulier! . l200ll . Theorem 4.1)). To simplify the 
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nota tions, put ak, n = akn( rn n)- Since lYmn^^m^ 1 +n 1 m 



0, it follows 



from Robinson ( 19951 ) that 

lim m n of, „ = 1/4 and max \a k n \ = O (mZ 1 ' 2 log(?n n ) ) . 
i— >oo ' ' Kk<m„ ' V / 



fe=l 



Set v n = L m nJ f° r some 7 6 (1/2, 1) and g n 
the function 5 is defined in (j 18j) . Note that 



' 1 Er= 1 [-21ogW], where 



max \au n \ V] l a fc,n| log 2 (u n ) < log 2 (m n ) ( max |o fc n | ) ->■ , 

Cfc<m n f — ' \l<fc<m n / 



l<k<m n 
m 



k=l 



\ a k,n\ log(v n )/v n < ml/ 2 7 log(m n )- 



k=l 



mn L Ek=l\-^OS^k)-9n\ 



To apply (jMoulines fc Soulierl . 120031 . Theorem 9.7), we finally need to prove that 
there exists a constant C such that, for all k G {1, ... , m n }, E log 2 (I g ^ n {u)k) / f (uk)) 
C. Corollary [3] shows that this bounds holds for any k € {K, . . . , m n }, where K 
is a fixed integer. For the first K frequencies, we use Theorem Q] to show that, 
for any given k and uniformly in n, the minimal eigenvalue of the covariance 
matrix of the random vector 

[Re{d n ,o(w fc )} ,lm{d nfi (uj k )} , . . . , Re {d n , g _i(u) k )} , Im {dn )fl -i(wjfc)}] 

is bounded away from zero. 



< 
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